Cluster discovery in spatial data mining: a variable resolution approach

نویسنده

  • A. J. Brimicombe
چکیده

Spatial data mining seeks to discover meaningful patterns from data where a key dimension of the data is geographical location. This spatial dimension becomes important when data either refer to specific locations andJor have significant spatial dependence and which needs to be taken into consideration if meaningfid patterns are to emerge. For point data there are two main groups of approaches. One stems from traditional statistical techniques such as k-means clustering in which every point is assigned to a spatial grouping and results in a spatial segmentation. The segmentation has k sub-regions, is usually space filling and non-overlapping (i.e. a tessellation) in which all points fall within a spatial segment. The difficulty with this approach is in defining k centroid locations at the outset of any data mining. The other broad approach searches for ‘hotspots’ which can be loosely defined as a localised excess of some incidence rate. In this approach not all points are necessarily assigned to clusters. It is the mainstay of those approaches which seek to identify any significantly elevated risk above what might be expected from an at-risk background population. Definition of the population at risk is clearly critical and in some data mining applications is not possible at the outset. This paper presents a novel variable resolution approach to cluster discovery which acts in the first instance to define spatial concentrations in the absence of population at risk. The cluster centroids are then used to establish initial centroids for techniques such as k-means clustering and arrive at a segmentation on the basis of point attributes. The variable resolution technique can thus be viewed as a bridge between the two broad approaches towards knowledge discovery in mining point data sets. The technique is equally applicable to the mining of business, crime, health and environmental data. A business-oriented case study is presented here. © 2002 WIT Press, Ashurst Lodge, Southampton, SO40 7AA, UK. All rights reserved. Web: www.witpress.com Email [email protected] Paper from: Data Mining III, A Zanasi, CA Brebbia, NFF Ebecken & P Melli (Editors). ISBN 1-85312-925-9

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spatial data mining and geographic knowledge discovery - An introduction

Voluminous geographic data have been, and continue to be, collected with modern data acquisition techniques such as global positioning systems (GPS), high-resolution remote sensing, location-aware services and surveys, and internet-based volunteered geographic information. There is an urgent need for effective and efficient methods to extract unknown and unexpected information from spatial data...

متن کامل

Expert Discovery: A web mining approach

Expert discovery is a quest in search of finding an answer to a question: “Who is the best expert of a specific subject in a particular domain within peculiar array of parameters?” Expert with domain knowledge in any field is crucial for consulting in industry, academia and scientific community. Aim of this study is to address the issues for expert-finding task in real-world community. Collabor...

متن کامل

An Integrated Approach for Regional Association Rule Mining and Scoping

A special challenge for spatial data mining is that information is not spread uniformly in spatial data sets. Consequently, the discovery of regional knowledge is of fundamental importance. However, traditional data mining techniques are ill-prepared for discovering regional knowledge. This paper introduces a methodology for mining spatial association rules and proposes novel algorithms to dete...

متن کامل

A Multi-Objective Approach to Fuzzy Clustering using ITLBO Algorithm

Data clustering is one of the most important areas of research in data mining and knowledge discovery. Recent research in this area has shown that the best clustering results can be achieved using multi-objective methods. In other words, assuming more than one criterion as objective functions for clustering data can measurably increase the quality of clustering. In this study, a model with two ...

متن کامل

Discovering spatial associations in images

In this paper, our focus in data mining is concerned with the discovery of spatial associations within images. Our work concentrates on the problem of nding associations between visual content in large image databases. Discovering association rules has been the focus of many studies in the last few years. However, for multimedia data such as images or video frames, the algorithms proposed in th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003